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Abstract. For many complex networks present in nature only a single instance, usually of large size, is 
available. Any measurement made on this single instance cannot be repeated on different realizations. In 
order to detect significant patterns in a real-world network it is therefore crucial to compare the measured 
results with a null model counterpart. Here we focus on dense and weighted networks, proposing a suitable 
null model and studying the behaviour of the degree correlations as measured by the rich-club coefficient. 
Our method solves an existing problem with the randomization of dense unweighted graphs, and at the same 
time represents a generalization of the rich-club coefficient to weighted networks which is complementary 
to other recently proposed ones. 

PACS. 89. 75. He Networks and genealogical trees - 89.75.Fb Structures and organization in complex 
systems 

1 Introduction in nature [TJ[2J- One of the most important local quantities 

is the number of connections of vertex i, called the degree 
Networks, identified by the set of connections (edges, or h- Real complex networks are characterised by specific 
links) between units (vertices) of a system, are widespread features such as small- world effect [3] and scale-invariant 
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degree distribution [4]. At the same time for many net- 
works (some maps of the Internet [5] , or the whole WWW 
[6 J or biological webs [7]) additional information on the 
intensity of connections is available. In this case, one can 
assign a weight Wij to the link between vertices i and j. 
The weighted counterpart of the degree is given by the 
strength s l = £\ toy . 

An important quantity, which is the main focus of the 
present work, is the rich-club coefficient (RCC) 4>{k) in- 
troduced as a correlation measure of the interconnectivity 
between nodes with a "large" degree [SJ. In particular, 
given the number E >k of edges between the n > k vertices 
whose degree is larger than k the RCC is defined as 
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In other words the RCC <fi(k) measures the probability 
that the edges between pairs of vertices whose degree is 
larger than k are actually drawn. If no edge is present then 
<p(k) = 0, while if all the possible edges are present then 
4>{k) = 1. The interest for this quantity comes from the 
fact that an increasing trend for the RCC would reveal the 
presence of correlations between large hubs. However it is 
easy to check that, even without any correlation, it is more 
likely that one edge is shared between two hubs rather 
than between two vertices with a low degree. In particular 
one finds that in an uncorrelated graph the RCC increases 
as k 2 for large k [9j. To overcome this problem, Colizza 
et al. [H] considered a suitable set of randomized versions 
of a given graph, and computed the corresponding RCC 
<f) rand (k) on this ensemble. They proposed to consider the 



in order to assess the presence of correlations in a variety 
of systems. The randomizing strategy adopted preserves 
the degree sequence of the original graph by using an algo- 
rithm proposed by Maslov, Sneppen and Zaliznyak (MSZ) 
[lOllllj . Two edges are selected, and one end vertex of the 
first is exchanged with one endvertex of the second and 
viceversa, as shown in Fig[T£i. This local rewiring is suc- 
cessful only if no edge already exists in place of the rewired 
ones. Otherwise, two new edges are selected and a new 
rewiring is attempted. 

This rewiring procedure presents some limitations in 
the case of dense graphs. Consider the limit case of a com- 
plete graph with N vertices: here no rewiring is possible 
and the randomized set of graphs corresponds to the sin- 
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Fig. 1. (a) The rewiring in the MSZ algorithm. Note that no 
other links are present between the vertices, (b) The rewiring 
in our new procedure transfers the weight also between exist- 
ing links. In unweighted networks this implies the creation of 
multiple links. Note that if wi = w m this would correspond to 
the removal of the link between A and B. 
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gle original instance. Therefore one always finds a value cient by explicitly transforming the graphs analysed into 
p(k) = 1 for every k, even if only the single value k = N — 1 weighted graphs. Our proposal is then to consider the net- 
is present in the network. However, this merely reflects the work as weighted, and redistribute the weights of the edges 
fact that the configuration space of the possible random- (possibly also on existing links) preserving the strength of 
izations is reduced to only one configuration. For dense the vertices as follows. Specifically, if w m is the minimum 
graphs, such as the World Trade Web [T2 | |13 1 ll4] we con- value of the weights in the original network we can mod- 
sider later on and some food webs [7] , the number of avail- ify the rewiring procedure by changing by an amount w m 
able configuration is also small. Furthermore, note that the weights of two pairs of edges as shown in FigfTJa. This 
most collaboration networks [2] (co-authorship, board of randomization (denoted hereafter as NEW) corresponds 
directors, movie actors) are obtained as projections of a to the original MSZ algorithm when the network is sparse 
bipartite graph. For example in co-authorship networks and unweighted. Indeed in this case the minimum weight 
there are two classes of vertices: the papers and the sci- w m is one and it is unlikely to draw twice the same edge, 
entists authoring them. Links only exist between vertices We then make a second generalization by defining a rich- 
belonging to different classes, and the graph is said to be club coefficient for the weighted case as 

bipartite. From this bipartite graph one can obtain a one- 2E >S 

^ {s) ^ n >s (n >s -iy (3) 

mode network whose vertices represent only the scientists, 

and all scientists co-authoring at least one paper are con- 
nected in a clique (fully connected subgraph). Thus, even 
if the network is on average sparse, it is divided into dense 



where E >s is the number of weighted edges between ver- 
tices whose strength is larger than s and n >s is the number 
of vertices whose strength is larger than s. We finally define 



cliques. In all these situations, any pair of edges belong- p w (s) = (4) 



ing to the most important subunits (the cliques) cannot be 



/ired, and the ratio p MSZ (k) = p(k) 



0(fc) 



rewirea, and tne ratio p [ k ) = p[k) = , h iszii.\ remains 



<t>% EW (s) 
We stress that our focus here is different from other recent 

generalizations of the rich-club coefficient, which are spe- 
close to 1 and cannot be used to assess the significance of 

cific to weighted networks [TMTo] . Our approach originates 
the results. 

from the aforementioned problem with the randomization 

of dense unweighted networks. The above definition, when 
2 New randomization and definition for the coupled with our new randomization procedure, solves this 

i-irh-rhih rnpffiripnt problem even for complete graphs. Moreover, it automat- 

ically recovers the original (and intuitive) MSZ approach 
To overcome this problem we propose to generalize the for sparse unweighted networks. Additionally, it provides 
MSZ algorithm and the definition of the rich-club coeffi- a coherent generalization of the rich-club to weighted net- 
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works, emphasising patterns which are complementary to weighted properties, including the weighted clustering co- 

those investigated by other proposals |15[fT6] , In our above efficient, the weighted average nearest neighbour degree 

definition we are considering only the numbers E >s of and the disparity, display a kind of Bosonic behaviour that 

edges and n >s of vertices, and not their weight. This cap- parallels the Fermionic one observed for unweighted net- 

tures a structural property of many social and/or eco- works with preserved degree |19j . In the present study we 

nomic networks where only the presence of a tie, even if are interested in understanding how the rich-club coeffi- 

weak [17] , plays a role in the organization of the system. If cient of the original network relates to that of the ran- 

combined with the other available definitions, this prop- domized variants. 

erty allows to recover a richer picture of the correlation ., ., ,, Al ,...,. 

J Also note that, among the other randomization dis- 

structure of weighted networks, as we show below for a , . ,, ,., . , , , ,, 

cussed m the literature, one m particular showed that 

particular example. Finally, the choice of the minimum • i _i ^ i 1 ^ 1 1 •. 

J preserving both the degree sequence and the modularity 

value w m automatically sets a scale for the weights in the . . , , , . ,-, , , . r , , 

J ° is m most cases enough to explain the correlations of the 

randomized ensemble. In this way we preserve the original , 177771 ^ ,, , , , . , , 

J r ° network [20] . Due to the close relation between degree cor- 

strengths without the need to regard each weighted edge , , . jj.i_-i.ii- cc ■ _ a 1 

000 relations and the rich-club coefficient, we expect that such 

as a superposition of binary edges and split them as in j • • u 4.u • u 1 u n /-> 

J or- a ran domization would preserve the rich-club as well. Our 

other approaches 115 . Indeed, the latter procedure is very . , . , , ,, 

1 — ' perspective is however in some sense reversed: we shall re- 
sensitive to the (arbitrary) choice of the unit of weight ITS]. , , , , •••■_. 

v ' l — ' gard dense subgraphs as arising m a bottom-up manner 

Remarkably, through a single definition we can address all r . , , , . , il ,, r . , , . , 

•" ° ° from the local constraints, rather than as forcing the nch- 

the above problems and unambiguously characterize any , , , . , r , . n ,, , , 

J club to emerge in a top-down fashion from the modular- 
network, ranging from unweighted to weighted, and from T , . , , . ... ,, , / .., 

00 ° ° lty. In particular, we expect initially dense (either topo- 

sparse to dense. in . ., , >, ii;j-i, j 

logically or m a weighted sense) modules to dilute, and 

vr , , •, , • j , 1 • 1 a possible reordering of vertices among modules for each 

JN ote that m our randomization strong links present in * ° ° 

,, .... . , ... , . . . randomized variant, 
the original network can be diluted into more and weaker 

links in the randomized ensemble. However, the strength In the same spirit, our method is different from ran- 

sequence {si]^ =1: representing the vector of the strengths domizations that step back to the bipartite structure of 

of all vertices, is preserved at any instance. As a result, collaboration or affiliation networks, rewire links preserv- 

it is possible to exploit the results for randomized net- ing the degrees in the bipartite graph, and then project the 

works with preserved strength sequence [19] to character- latter to obtain a randomized one-mode network. In both 

ize how a particular strength distribution affects the prop- methods, cliques that are present in the original network 

erties of the randomized networks. The main lesson is that will not be preserved. However, the bipartite randomiza- 
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Fig. 2. The 10 configurations arising from the randomization 
of the complete graph K 4 . In all configurations the strength 
s of each vertex is preserved. However, for s — 0, 1, 2 one has 
£ >s = 6 in A, while E >s — 4 in B-G, and E >s — 2 in H-J. 

tion will create other cliques in any variant of the network, 
arising as the two-mode network is projected onto the one- 
mode one. 

3 Sparse and dense unweighted graphs 

The simplest example to describe our procedure is to start 
from a simple unweighted graph, where w m = 1. If the 
network is sparse the procedure is the analogous of the 
MSZ one. By contrast, dense unweighted graphs give rise 
to the creation of multiple edges. For instance, we illus- 
trate the possible outcomes for a complete graph with 4 
vertices (K ). Starting from the original configuration A 
shown in Fig[2j the flipping of edges e and £ will produce 
either the configurations B or D. If different edges are 
selected, we find that starting from A we can reach also 
the configurations C, E, F, G. Similarly, starting from 
the configuration H, we can flip any of the edges a,/3,7 
once with any of the edges 6, e, (. In one half of the cases 
we move to the configuration D, in the other half to the 
configuration E. Therefore phd = Phe = 1/2- One can 



T = 



repeat this calculation for any other starting configuration 
keeping track of the various probabilities. In this way we 
can write the following transition matrix T: 

I 2/5 2/5 2/5 2/5 2/5 2/5 \ 
1/6 1/10 2/5 1/2 
1/6 1/10 2/5 1/2 
1/6 2/5 1/10 1/2 
1/6 1/10 2/5 1/2 
1/6 2/5 1/10 1/2 
1/6 2/5 1/10 1/2 
1/10 1/10 
1/10 1/10 

\ 1/10 1/10 ) 

Iterating T yields the steady state matrix T°°, whose 
columns are all equal to each other and represent the 
vector of densities of the various configurations. We find 
N(A) = 4/15, N(B) = N{C) = N(D) = 7V(E) = N(F) = 
N(G) = l/9,iV(H) = N(I) = N(J) = 1/45. Note that 
in this case the strength is always an integer number, 
corresponding to multiple links (w m = 1). In particular, 
the strength of every vertex remains equal to the original 
value 3. However, the E >s in our definition (J3J) counts all 
multiple edges created by the randomization as a single 
weighted edge. Therefore, while in the original instance 
0mi(s) = 4>{k) = 1 for every value of s and k, across the 
randomized ensemble we obtain a (f>^, EW (s) = 11/15 ~ 
0.73 for s = 0, 1,2 smaller than the value c/) MSZ (k) = 1. 
This yields p w {s) — 15/11 ~ 1.36. In general, for complete 
graphs our procedure yields values of p w (s) systematically 
larger than the value 1, correctly indicating the presence 
of the rich-club effect. 

In order to investigate the rich club effect in the whole 
range between the extreme examples of sparse and fully 



V. Zlatic et al.: On the rich-club effect in dense and weighted networks 



connected networks, we consider (Gilbert) Random Graphs 
[2T1I22] corresponding to different values of the probability 
p to draw an edge. We consider the range from p = 10~ 3 to 
1/2. We study the difference between our method and the 
traditional unweighted one by considering the ratio of the 
two densities p w (s)/p M (k) computed with the present 
randomization and with the original MSZ algorithm. As 
previously noticed, when randomizing an originally un- 
weighted graph the strength s is given by the number of 
multiple edges that may be created in it. However, the 
original random graph has no such multiple edges, thus 
for the initial instance k — s and <f> w (s) = 4>{k). We can 

then write 

P w (s) = <f> MSZ ( S ) ,. 

which is independent of the original instance. For MSZ 
the argument s always coincides with the original degree 
k which is preserved, while this is not the case for our 
definitions. The behaviour of the above ratio is shown in 
Fig[5J for various random graphs. In the main panel we 
show different plots corresponding to different choices of 
p. The trends are all characterized by a flat plateau whose 
height is larger than one. Therefore we find that our ap- 
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proach systematically yields 



MSZ > iNEW 



, correspond- 



ing to p w < p MSZ . This effect becomes more pronounced 
as the density increases. This is clearly illustrated in the 
inset, where the height of the plateau shown in the main 
panel is now plotted as a function of p. As expected, for 
sparse graphs the ratio is close to one, since our gener- 
alized approach recovers the MSZ one. By contrast, for 
denser networks we correctly find that our algorithm ex- 



Fig. 3. Rich-club analysis of random graphs. For sparse net- 
works the ratio <f) M /<f> w is close to 1 and therefore our 
proposed randomization has the same desired properties as 
the MSZ procedure. When the density increases, <j> w becomes 
smaller than 4> MSZ , meaning that p MSZ > p w . This is cor- 
rect since in the limit of a complete graph the MSZ procedure 
can only yield <j> M — <j>, as no randomization is possible. 
However, this is the configuration for which one expects the 
maximum value of the rich-club coefficient. 

plores a larger portion of configuration space, confirming 
the results discussed for the extreme example of K A . 

4 Weighted graphs 

Finally, wc illustrate our approach on inherently weighted 
networks. The example we consider is that of the World 
Trade Web (WTW) [1211151 113] . In this network the ver- 
tices are world countries, and a link represents a recipro- 
cal trade relationship whose weight is given in millions of 
dollars. The data report trading exchanges between world 
countries for each year starting from 1948 to 2000 |23j . 
The values of the weights span six orders of magnitude; 
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for this reason it is more convenient to redefine the weight 
as the logarithm of the original trade volume in dollars. 
In the mail panel of FigH] we present the rich-club analy- 
sis on different snapshots of the weighted WTW. The first 
immediate result is that p MSZ {k) sometimes indicates an- 
ticorrelation between hubs. However, since in this region 
the number of configurations accessible by MSZ is really 
small, we interpret this feature as fluctuations in the rich- 
club coefficient given by the poor statistics. In the case 
of p w (s) obtained with our algorithm, we instead identify 
a positive correlation between medium and large hubs as 
confirmed by a series of studies on the wealth and devel- 
opment of nations 23,24]. This detects a significant cor- 
relation for countries with an intermediate strength value. 
Interestingly, this correlated region moves towards larger 
strength values as time proceeds from I960 to 1990. Fur- 
thermore we note that we can detect additional informa- 
tion, as compared with the MSZ algorithm, even if we 
ignore the information about the weights of the WTW. 
Indeed, if we discard the weights and simply consider the 
WTW as an unweighted network, then we can apply our 
strategy as already illustrated in the previous unweighted 
examples. For the year 1970, this is shown in the inset of 
FigHl The original network has a connectance (number 
of actual edges divided by the number of possible ones) 
approximately equal to the very large value 0.5. As usual, 
when using the traditional MSZ algorithm we obtain a 
constant value of p M (k) close to one, mostly due to the 
small number of available random configurations. How- 
ever, using our measure p w (s) we observe a decreasing 
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Fig. 4. Main panel: our weighted rich-club coefficient p w , com- 
pared with the unweighted one p , for 4 different years of 
WTW when the network is considered weighted. Inset: same 
analysis on the unweighted version of the WTW for the year 
1970. Other possible comparisons show similar discrepancies 
between the two procedures. 

trend towards 1. This partially recovers the information 
on the positively correlated range for small and intermedi- 
ate strength values. For larger values we find p w — 1, and 
no particular correlation can be extracted from the data. 
In this case, however, we can exclude that this feature is 
originated by the collapse of the available configuration 
space for randomized configurations. 

5 Conclusions 

In conclusion, we have presented here a novel way to de- 
fine a suitable randomization for dense and weighted net- 
works. Various null models for weighted graphs have al- 
ready been introduced ^5lll6ill4il25H26l[27] . depending on 
the properties one is interested in. Our emphasis here is 
on a problem arising when dense unweighted graphs are 
randomized while keeping the topological degrees fixed. In 
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this case the collapse of the available configuration space 9. V. Colizza, A. Flammini, M.-A. Serrano A. Vespignani, 

has undesired effects on the rich club coefficient. We have Nature Physics 2, 110-115 (2006). 

therefore defined a procedure that solves this problem and 10. S. Maslov, K. Sneppen, Science 296, 910-913 (2002). 

at the same time provides a generalization of the rich-club 11. S. Maslov, K. Sneppen A. Zaliznyak, Physica A 333, 529- 

property which is complementary to other recently pro- 540 (2004). 

posed ones [TSlfTB] . We have shown the advantage of our 12. M-A. Serrano M. Bogufia, Physical Review E 68, 015101 

approach on unweighted graphs ranging from sparse to (2003). 

dense, and even complete. We have also shown its appli- 13 - D. Garlaschelli, M.I. Loffredo, Physical Review Letters 93 

cation to the real World Trade Web, both in its weighted 188701 (2004). 

and unweighted version. Our method outperforms the un- 14 - K - Bhattacharya, G. Mukherjee, J. Saramaki, K. Kaski 

weighted approach and recovers significant information on S ' S ' Manna ' J ' Stat ' Mech P02002 ( 2008 ) ' 

the correlation structure of this network, which is in ac- 15 ' M - A ' Serrano , l arXiv:0802.3122 >l 

cordance with independent analyses of it. 16 ' T ' °P sah1 ' V ' Colizza ' R Panzarasa J- J- Ramasco, 
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